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(57) Abstract: Blocks of audio are encoded based upon corresponding first and second fiiequencies. The first and second fi-equencies 

are hopped from block to block. An audio quality measure (AQM) is computed for each block of audio such that, if x out of y blocks 
^ of audio have an AQM greater than a first predetermined threshold, encoding is suspended. For example, x may be nine and y may 

be 1 6. Also, if a ratio of the energy in a front part of a bJock of audio to the energy in a rear part of the block of audio is greater than 
O a second predetermined threshold, that block of audio is not encoded even though x out of y blocks of audio have an AQM greater 
^ than the first predetermined threshold. Multiple distributors of the audio may encode the audio with their corresponding identities 

using the above processes. 
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SYSTEM AND METHOD FOR ENCODING AN AUDIO SIGNAL FOR USE IN BROADCAST PROGRAM 
IDENTIFICATION SYSTEMS, BY ADDING INAUDIBLE CODES TO THE AUDIO SIGNAL 

Related Application 

This application is a continuation-in-part of U.S. 
Patent Application Serial No. 09/116,397 filed July 16, 1998. 
This application also contains disclosure similar to the 
disclosure in U.S. Application Serial No. (28019/35519) ." 

Technical Field the ];^ve^tj,Qn 

The present invention relates to spectral audio 
encoding useful, for example, in modulating broadcast signals 
in order to add identifying codes thereto. 

Background of the Invention 

Several approaches to metering the video and/or 
audio tuned by television and/or radio receivers in order to 
determine the sources or identities of corresponding televi- 
sion or radio programs are known. For example, one approach 
is to real time correlate a program to which a receiver is 
tuned with each of the programs available to the receiver. An 
apparatus useful for this measurement approach is found in the 
teachings of Lu et al. in U.S. Patent No. 5,594,934. 

Another approach is to extract a characteristic 
signature (or a characteristic signature set) from the program 
selected for viewing and/or listening, and to compare the 
characteristic signature (or characteristic signature set) 
with reference signatures (or reference signature sets) col- 
lected from known transmission sources at a reference site. 
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Although the reference site could be the viewer's household, 
the reference site is usually at a location which is remote 
from the households of all of the viewers being monitored. 
Systems using signature extraction are taught by Lert and Lu 
5 in U.S. Patent No. 4,677,466 and by Kiewit and Lu in U.S. 
Patent No. 4,697,209. 

In signature extraction systems, audio characteris- 
tic signatures are often utilized. Typically, these charac- 
teristic signatures are extracted by a unit located at the 

10 monitored receiver, sometimes referred to as a site unit. The 
site unit monitors the audio output of a television or radio 
receiver either by means of a microphone that picks up the 
sound from the speakers of the monitored receiver or by means 
of an output line from the monitored receiver. The site unit 

15 extracts and transmits the characteristic signatures to a 

central household unit, sometimes referred to as a home unit. 
Each characteristic signature is designed to uniquely charac- 
terize the audio signal tuned by the receiver during the time 
of signature extraction. 

20 Characteristic signatures are typically transmitted 

from the home unit to a central office where a matching opera- 
tion is performed between the characteristic signatures and a 
set of reference signatures extracted at a reference site from 
all of the audio channels that could have been tuned by the 

25 receiver in the household being monitored. A matching score 
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is computed by a matching algorithm and is used to determine 
the identity of the program to which the monitored receiver 
was tuned or the program source (such as a broadcaster) of the 
tuned program. 

5 Yet another approach to metering video and/or audio 

tuned by televisions and/or radios is to add ancillary identi- 
fication codes to television and/or radio programs and to 
detect and decode the ancillary codes in order to identify the 
encoded programs or the corresponding program sources when the 

10 programs are tuned by monitored receivers. There are many 

arrangements for adding an ancillary code to a signal in such 
a way that the added code is not noticed. It is well known in 
television broadcasting, for example, to hide such ancillary 
codes in non-viewable portions of video by inserting them into 

15 either the video's vertical blanking interval or horizontal 
retrace interval . An exemplary system which hides codes in 
non- viewable portions of video is referred to as "AMOL" and is 
taught in U.S. Patent No. 4,025,851. This system is used by 
the assignee of this application for monitoring transmissions 

20 of television programming as well as the times of such trans- 
missions . 

Other known video encoding systems have sought to 
bury the ancillary code in a portion of a television signal's 

transmission bandwidth that otherwise carries little signal 
25 energy. An example of such a system is disclosed by Dougherty 
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in U.S. Patent No. 5,629,739, which is assigned to the as- 
signee of the present application. 

Other methods and systems add ancillary codes to 
audio signals for the purpose of identifying the signals and, 
5 perhaps, for tracing their courses through signal distribution 
systems. Such arrangements have the obvious advantage of 
being applicable not only to television, but also to radio • 
transmissions and to pre-recorded music. Moreover, ancillary 
codes which are added to audio signals may be reproduced in 

10 the audio signal output by a speaker. Accordingly, these 

arrangements offer the possibility of non- intrusively inter- 
cepting and decoding the codes with equipment that has a 
microphone as an input. In particular, these arrangements 
provide an approach to measuring program audiences by the use 

15 of portable metering equipment carried by panelists. 

One such audio encoding system is disclosed by 
Crosby, in U.S. Patent No. 3,845,391. In this system, a code 
is inserted in a narrow frequency "notch" from which the 
original audio signal is deleted. The notch is made at a 

20 fixed predetermined frequency (e.g., 40 Hz). This approach 

led to codes that were audible when the original audio signal 
containing the code was of low intensity. 

A series of improvements followed the Crosby patent. 
Thus, Howard, in U.S. Patent No. 4,703,476, teaches the use of 

25 two separate notch frequencies for the mark and the space 
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portions of a code signal. Kramer, in U.S. Patent No. 
4,931,871 and in U.S. Patent No. 4,945,412 teaches, inter 
alia, using a code signal having an amplitude that tracks the 
amplitude of the audio signal to which the code is added. 
5 Program audience measurement systems in which panel- 

ists are expected to carry microphone -equipped audio monitor- 
ing devices that can pick up and store inaudible codes trans- 
mitted in an audio signal are also known. For example, 
Aijalla et al., in WO 94/11989 and in U.S. Patent No. 

10 5,579,124, describe an arrangement in which spread spectrum 
techniques are used to add a code to an audio signal so that 
the code is either not perceptible, or can be heard only as 
low level "static" noise. Also, Jensen et al . , in U.S. Patent 
No. 5,450,490, teach an arrangement for adding a code at a 

15 fixed set of frequencies and using one of two masking signals, 
where the choice of masking signal is made on the basis of a 
frequency analysis of the audio signal to which the code is to 
be added, Jensen et al . do not teach a coding arrangement in 
which the code frequencies vary from block to block. The 

20 intensity of the code inserted by Jensen et al . is a predeter- 
mined fraction of a measured value (e.g., 30 dB down from peak 
intensity) rather than comprising relative maxima or minima. 

Moreover, Preuss et al . , in U.S. Patent No. 
5,319,735, teach a multi-band audio encoding arrangement in 

25 which a spread spectrum code is inserted in recorded music at 
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a fixed ratio to the input signal intensity (code-to-music 
ratio) that is preferably 19 dB. Lee et al . , in U.S. Patent 
No. 5,687,191, teach an audio coding arrangement suitable for 
use with digitized audio signals in which the code intensity 
5 is made to match the input signal by calculating a signal-to- 
mask ratio in each of several frequency bands and by then 
inserting the code at an intensity that is a predetermined 
ratio of the audio input in that band. As reported in this 
patent, Lee et al . have also described a method of embedding 

10 digital information in a digital waveform in pending U.S. 
application Serial No. 08/524,132. 

It will be recognized that, because ancillary codes 
are preferably inserted at low intensities in order to prevent 
the code from distracting a listener of program audio, such 

15 codes may be vulnerable to various signal processing opera- 
tions. For example, although Lee et al . discuss digitized 
audio signals, it may be noted that many of the earlier known 
approaches to encoding an audio signal are not compatible with 
current and proposed digital audio standards, particularly 

20 those employing signal compression methods that may reduce the 
signal's dynamic range (and thereby delete a low level code) 
or that otherwise may damage an ancillary code. In this 
regard, it is particularly important for an ancillary code to 
survive compression and subsequent de-compression by the AC-3 

25 algorithm or by one of the algorithms recommended in the 
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ISO/IEC 11172 MPEG standard, which is expected to be widely- 
used in future digital television transmission and reception 
systems . 

U.S. Patent Application Serial No. 09/116,397 filed 
5 July 16, 1998 discloses a system and method for inserting a 
code into an audio signal so that the code is likely to sur- 
vive compression and decompression as required by current and 
proposed digital audio standards. In this system and method, 
spectral modulation at selected code frequencies is used to 
10 insert the code into the audio signal. These code frequencies 
are varied from audio block to audio block, and the spectral 
modulation may be implemented as amplitude modulation, modula- 
tion by frequency swapping, phase modulation, and/or odd/even 
index modulation. 

15 In most audio signals of the type used in television 

systems, a code inserted by spectral modulation in accordance 
with the aforementioned patent application is substantially 
inaudible. However, there are some instances where the code 
may be undesirably audible. The present invention addresses 

20 one or more of these instances. The present application also 
addresses methods of multi-level coding. 



Summary of the Invention 

According to one aspect of the present invention, a 
method for encoding first and second blocks of audio with 
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corresponding first and second binary code bits comprises the 
following steps: a) selecting first and second frequencies 
from a frequency spectrum of the first block of audio; b) 
modulating the audio based upon the first and second frequen- 
5 cies to thereby encode the first block of audio with the first 
binary code bit; c) selecting third and fourth frequencies 
from a frequency spectrum of the second block of audio, 
wherein the third and fourth frequencies bear a predetermined 
offset relationship to the first and second frequencies; and, 

10 d) modulating the audio based upon the third and fourth fre- 
quencies to thereby encode the second block of audio with the 
second binary code bit . 

According to another aspect of the present inven- 
tion, a method for encoding a block of audio with a binary 

15 code bit comprises the following steps: a) selecting a fre- 
quency from a frequency spectrum of the block of audio; b) 
selectively amplifying an odd index frequency in a neighbor- 
hood of the selected frequency to be a local maximum if the 
block of audio is to be encoded with the binary code bit 

20 having a first value; and, c) selectively amplifying an even 
index frequency in a neighborhood of the selected frequency to 
be a local maximum if the block of audio is to be encoded with 
the binary code bit having a second value. 

According to still another aspect of the present 

25 invention, a method for encoding blocks of audio with binary 
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code bits comprises the following steps: a) determining an 
audio quality measure AQM for each block of audio; b) compar- 
ing the AQM corresponding to each block of audio to AQMthresh, 
wherein AQM^hresh is a predetermined audio quality measure 
5 reference; c) if AQM < AQMthresh ^ blocks of audio out of y 
blocks of audio, encoding the blocks of audio with binary 
bits, wherein x and y are corresponding predetermined numbers 
of blocks of audio; and, d) if AQM > AQMthresh for the x blocks 
of audio out of the y blocks of audio, suspending encoding of 

10 the blocks of audio. 

According to yet another aspect of the present 
invention, a method is provided to encode a block of audio 
with a binary code bit. The block of audio has an energy. 
The method comprises the following steps: a) determining a 

15 ratio E1/E2, wherein is the energy in a first portion of the 
block of audio, and wherein Ej is the energy in a second 
portion of the block of audio; b) modulating the block of 
audio with the binary code bit if E1/E2 > EpR^, wherein EpRg is 
a predetermined reference; and, c) not modulating the block 

20 of audio with the binary code bit if E1/E2 < EpRg. 

According to a further aspect of the present inven- 
tion, a method of encoding blocks of audio with binary code 
bits comprises the following steps: a) encoding each of the 
blocks of audio with a binary bit by modulating the audio 

25 within the corresponding block of audio at selected first and 
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second frequencies, wherein the selected first and second 
frequencies are hopped from block to block; and, b) executing 
step a) so as to indicate first and second levels of distribu- 
tion of the audio. 
5 According to a still further aspect of the present 

invention, a method for decoding first and second blocks of 
audio in order to recover corresponding first and second 
binary code bits therefrom comprises the following steps: a) 
detecting first and second frequencies from a frequency spec- 

10 trum of the first block of audio; b) demodulating the first 

and second frequencies in order to recover to the first binary 
code bit; c) detecting third and fourth frequencies from a 
frequency spectrum of the second block of audio, wherein the 
third and fourth frequencies bear a predetermined offset 

15 relationship to the first and second frequencies; and, d) 
demodulating the third and fourth frequencies in order to 
recover the second binary code bit . 

According to yet a further aspect of the present 
invention, a method of decoding blocks of audio encoded with 

20 binary code bits comprises the following steps: a) decoding 
each of the blocks of audio in order to recover a correspond- 
ing binary bit by demodulating the audio within the corre- 
sponding block of audio at selected first and second frequen- 
cies, wherein the selected first and second frequencies are 
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hopped from block to block; and, b) executing step a) so as 
identify first and second distributors of the audio. 

According to another aspect of the present inven- 
tion, a method of decoding a block of audio in order to re- 
5 cover a binary code bit therefrom comprises the following 

steps: a) detecting a frequency having an amplitude maximum 
within a selected frequency neighborhood of the block of 
audio; b) if the frequency detected in step a) corresponds to 
an odd frequency index, decoding the frequency as a binary 
10 code bit having a first value; and, c) if the frequency 

detected in step a) corresponds to an even frequency index, 
decoding the frequency as a binary code bit having a second 
value . 



Brief Description of the Drawing 
15 These and other features and advantages will become 

more apparent from a detailed consideration of the invention 
when taken in conjunction with the drawings in which: 

Figure 1 is a schematic block diagram of an audi- 
ence measurement system enploying the signal coding and decod- 
20 ing arrangements of the present invention; 

Figure 2 is flow chart depicting steps performed by 
an encoder of the system shown in Figure 1; 

Figure 3 is a spectral plot of an audio block, 
wherein the thin line of the plot is the spectrum of the 



- 11 - 



wo 01/31816 

1 



PCT/USOO/03829 



original audio signal and the thick line of the plot is the 
spectrum of the signal modulated in accordance with the pres- 
ent invention; 

Figure 4 depicts a window function which may be used 
5 to prevent transient effects that might otherwise occur at the 
boundaries between adjacent encoded blocks; 

Figure 5 is a schematic block diagram of an arrange- 
ment for generating a seven-bit pseudo-noise synchronization 
sequence ; 

10 Figure 6 is a spectral plot of a "triple tone" audio 

block which forms the first block of a preferred synchroniza- 
tion sequence, where the thin line of the plot is the spectrum 
of the original audio signal and the thick line of the plot is 
the spectrum of the modulated signal; 

15 Figure 7a schematically depicts an arrangement of 

synchronization and information blocks usable to form a com- 
plete code message; 

Figure 7b schematically depicts further details of 
the synchronization block shown in Fig. 7a; 

20 Figure 8 is a flow chart depicting steps performed 

by a decoder of the system shown in Figure 1; and, 

Figure 9 illustrates an encoding arrangement in 
which audio encoding delays are compensated in the video data 
stream. 
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Detailed Description of the Invention 

Audio signals are usually digitized at sampling 
rates that range between thirty- two kHz and forty-eight kHz. 
For example, a sampling rate of 44.1 kHz is commonly used 
5 during the digital recording of music. However, digital 

television ("DTV") is likely to use a forty eight kHz sampling 
rate. Besides the sampling rate, another parameter of inter- 
est in digitizing an audio signal is the number of binary bits 
used to represent the audio signal at each of the instants 

10 when it is sampled. This number of binary bits can vary, for 
example, between sixteen and twenty four bits per sample. The 
amplitude dynamic range resulting from using sixteen bits per 
sample of the audio signal is ninety- six dB. This decibel 
measure is the ratio between the square of the highest audio 

15 amplitude {2'^^ = 65536) and the lowest audio amplitude (1^ = 
1) . The dynamic range resulting from using twenty- four bits 
per sample is 144 dB. Raw audio, which is sampled at the 44.1 
kHz rate and which is converted to a sixteen-bit per sample 
representation, results in a data rate of 705.6 kbits/s. 

20 Compression of audio signals is performed in order 

to reduce this data rate to a level which makes it possible to 
• transmit a stereo pair of such data on a channel with a 
throughput as low as 192 kbits/s. This compression typically 
is accomplished by transform coding. A block consisting of 

25 = 1024 samples, for example, may be decomposed, by application 
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of a Fast Fourier Transform or other similar frequency analy- 
sis process, into a spectral representation. In order to 
prevent errors that may occur at the boundary between one 
block and the previous or subsequent block, overlapped blocks 
5 are commonly used. In one such arrangement where 1024 samples 
per overlapped block are used, a block includes 512 samples of 
"old" samples (i.e., samples from a previous block ) and 512 
samples of "new" or current samples. The spectral representa- 
tion of such a block is divided into critical bands where each 

10 band comprises a group of several neighboring frequencies. 

The power in each of these bands can be calculated by summing 
the squares of the amplitudes of the frequency components 
within the band. 

Audio compression is based on the principle of 

15 masking that, in the presence of high spectral energy at one 
frequency (i.e., the masking frequency), the human ear is 
unable to perceive a lower energy signal if the lower energy 
signal has a frequency (i.e., the masked frequency) near that 
of the higher energy signal. The lower energy signal at the 

20 masked frequency is called a masked signal. A masking thresh- 
old, which represents either (i) the acoustic energy required 
at the masked frequency in order to make it audible or (ii) an 
energy change in the existing spectral value that would be 
perceptible, can be dynamically computed for each band. The 

25 frequency components in a masked band can be represented in a 
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coarse fashion by using fewer bits based on this masking 
threshold. That is, the masking thresholds and the amplitudes 
of the frequency components in each band are coded with a 
smaller number of bits which constitute the compressed audio. 
5 Decompression reconstructs the original signal based on this 
data. 

Figure 1 illustrates an audience measurement system 
10 in which an encoder 12 adds an ancillary code to an audio 
signal portion 14 of a program signal to be transmitted. 

10 Alternatively, the encoder 12 may be provided, as is known in 
the art, at some other location in the program signal distri- 
bution chain. A transmitter 16 transmits the encoded audio 
signal portion with a video signal portion 18 of the program 
signal. When the encoded signal is received by a receiver 20 

15 located at a statistically selected metering site 22, the 
ancillary code is recovered by processing the audio signal 
portion of the received program signal even though the pres- 
ence of that ancillary code is imperceptible to a listener 
when the encoded audio signal portion is supplied to speakers 

20 24 of the receiver 20, To this end, a decoder 26 is connected 
either directly to an audio output 28 available at the re- 
ceiver 20 or to a microphone 30 placed in the vicinity of the 
speakers 24 through which the audio is reproduced. The re- 
ceived audio signal can be either in a monaural or stereo 

25 format . 
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ENCODING BY SPECTRAL MODULATION 
In order for the encoder 12 to embed a digital code 
in an audio data stream in a manner compatible with compres- 
sion technology, the encoder 12 should preferably use frequen- 
5 cies and critical bands that match those used in compression. 
The block length Nc of the audio signal that is used for 
coding may be chosen such that, for example, jNq = N^ = 1024, 
where j is an integer. A suitable value for Nc may be, for 
example, 512. As depicted by a step 40 of the flow chart 

10 shown in Figure 2, which is executed by the encoder 12, a 
first block v(t) of samples is derived from the audio 
signal portion 14 by the encoder 12 such as by use of an 
analog to digital' converter, where v(t) is the time-domain 
representation of the audio signal within the block. An 

15 optional window may be applied to v(t) at a block 42 as dis- 
cussed below in additional detail. Assuming for the moment 
that no such window is used, a Fourier Transform S{v(t) } of 
the block v(t) to be coded is computed at a step 44. (The 
Fourier Transform implemented at the step 44 may be a Fast 

20 Fourier Transform.) 

The frequencies resulting from the Fourier Transform 
are indexed in the range -256 to +255, where an index of 255 
corresponds to exactly half the sampling frequency fg. There- 
fore, for a forty-eight kHz sampling frequency, the highest 

25 index would correspond to a frequency of twenty- four kHz. 
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Accordingly, for purposes of this indexing, the index closest 
to a particular frequency component f j resulting from the 
Fourier Transform S{v(t)} is given by the following equation: 

- (1) 

where equation (1) is used in the following discussion to 
relate a frequency f j and its corresponding index Ij . 

The code frequencies fi used for coding a block may 
be chosen from the Fourier Transform S{v(t)) at a step 46 in 
the 4.8 kHz to 6 kHz range in order to exploit the higher 
auditory threshold in this band. Also, each successive bit of 
the code may use a different pair of code frequencies f^ and fo 
denoted by corresponding code frequency indexes Ii and Iq. 
There are two preferred ways of selecting the code frequencies 
fi and fo at the step 46 so as to create an inaudible wide -band 
noise like code. 

(a) Direct Sequence 
One way of selecting the code frequencies f^ and fo 
at the step 46 is to compute the code frequencies by use of a 

frequency hopping algorithm employing a hop sequence Hg and a 
shift index Ighift- For example, if Ng bits are grouped to- 
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gether to form a pseudo-noise sequence, Hg is an ordered 
sequence of Ng numbers representing the frequency deviation 
relative to a predetermined reference index 1^^, For the case 
where Ng = 7, a hop sequence Hg = {2,5,1,4,3,2,5} and a shift 
5 index Ighift = 5, for example, could be used. In general, the 
indices for the Ng bits resulting from a hop sequence may be 
given by the following equations: 



(2) 



4 = hk + f^s * hhiff (3) 

One possible choice for the reference frequency fgj^ is five 
10 kHz, for example, which corresponds to a predetermined refer- 
ence index Igj^ = 53 . This value of t^^ is chosen because it is 
above the average maximum sensitivity frequency of the human 
ear. When encoding a first block of the audio signal, and 
Iq for the first block are determined from equations (2) and 
15 (3) using a first of the hop sequence numbers; when encoding 
a second block of the audio signal, and !„ for the second 
block are determined from equations (2) and (3) using a second 
of the hop sequence numbers; and so on. For the fifth bit in 
the sequence {2,5,1,4,3,2,5}, for example, the hop sequence 
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value is three and, using equations (2) and (3) , produces an 
index = 51 and an index Iq = 61 in the case where Ighift = ^^ 
In this example, the mid- frequency index is given by the 
following equation: 



5 where I^i^ represents an index mid-way between the code fre- 
quency indices Ii and Iq. Accordingly, each of the code fre- 
quency indices is offset from the mid-frequency index by the 
same magnitude, Ighiff the two offsets have opposite signs. 

(b) Hopping based on low frequency maximum 
10 Another way of selecting the code frequencies at the 

step 46 is to determine a frequency index 1^^^^ at which the 
spectral power of the audio signal, as determined as the step 
44, is a maximum in the low frequency band extending from zero 
Hz to two kHz. In other words, I^ is the index corresponding 
15 to the frequency having maximum power in the range of 0 - 2 
kHz. It is useful to perform this calculation starting at 
index 1, because index 0 represents the "local" DC component 
and may be modified by high pass filters used in compression. 
The code frequency indices I^ and Ig are chosen relative to the 
20 frequency index I^^^ so that they lie in a higher frequency 
band at which the human ear is relatively less sensitive. 
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Again, one possible choice for the reference frequency f^y^ is 
five kHz corresponding to a reference index Ig^ = 53 such that 
Ii and Iq are given by the following equations: 



-'o ~ -^5* * 4iax ^shift (6) 

5 where Ighift is a shift index, and where varies according 

to the spectral power of the audio signal. An important 
observation here is that a different set of code frequency 
indices Ii and Iq from input block to input block is selected 
for spectral modulation depending on the frequency index 

10 of the corresponding input block. In this case, a code bit is 
coded as a single bit: however, the frequencies that are used 
to encode each bit hop from block to block. 

Unlike many traditional coding methods, such as 
Frequency Shift Keying (FSK) or Phase Shift Keying (PSK) , the 

15 present invention does not rely on a single fixed frequency. 
Accordingly, a "frequency -hopping" effect is created similar 
to that seen in spread spectrum modulation systems. However, 
unlike spread spectrum, the object of varying the coding 



-20- 



wo 01/31816 



PCT/USOO/03829 



frequencies of the present invention is to avoid the use of a 
constant code frequency which may render it audible. 

For either of the two code frequencies selection 
approaches (a) and (b) described above, there are at least 
5 four modulation methods that can be implemented at a step 56 
in order to encode a binary bit of data in an audio block, 
i.e., amplitude modulation, modulation by frequency swapping, 
phase modulation, and odd/even index modulation. These four 
methods of modulation are separately described below, 

10 (i) Amplitude Modulation 

In order to code a binary '1' using amplitude modu- 
lation, the spectral power at is increased to a level such 
that it constitutes a maximum in its corresponding neighbor- 
hood of frequencies. The neighborhood of indices correspond- 

15 ing to this neighborhood of frequencies is analyzed at a step 
48 in order to determine how much the code frequencies f^ and 
fo must be boosted and attenuated, respectively, so that they 
are detectable by the decoder 26. For index I^, the neighbor- 
hood may preferably extend from - 2 to + 2, and is con- 

20 strained to cover a narrow enough range of frequencies that 

the neighborhood of does not overlap the neighborhood of 1q. 
Simultaneously, the spectral power at Iq is modified in order 
to make it a minimum in its neighborhood of indices ranging 
from Iq - 2 to Iq + 2. Conversely, in order to code a binary 



-21 - 



wo 01/31816 



PCTAJSOO/03829 



'0' using amplitude modulation, the power at Iq is boosted and 
the power at is attenuated in their corresponding neighbor- 
hoods . 

As an example, Figure 3 shows a typical spectrum 50 
5 of a Nc sample audio block plotted over a range of frequency- 
index from forty five to seventy seven, A spectrum 52 shows 
the audio block after coding of a '1' bit, and a spectrum 54 
shows the audio block before coding. In this particular 
instance of encoding a ' 1 ' bit according to code frequency 

10 selection approach (a) , the hop sequence value is five which 
yields a mid-frequency index of fifty eight. The values for 
Ii and Iq are fifty three and sixty three, respectively. The 
spectral amplitude at fifty three is then modified at a step 
56 of Figure 2 in order to make it a maximum within its neigh- 

15 borhood of indices. The amplitude at sixty three already 

constitutes a minimum and, therefore, only a small additional 
attenuation is applied at the step 56. 

The spectral power modification process requires the 
computation of four values each in the neighborhood of I^ and 

20 Iq. For the neighborhood of Ii these four values are as fol- 
lows: (1) Imaxi which is the index of the frequency in the 
neighborhood of having maximum power; (2) P^axi which is the 
spectral power at In^xi; (3) Imini which is the index of the 
frequency in the neighborhood of Ii having minimum power; and 
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(4) Pmini which is the spectral power at Imini- Corresponding 
values for the Iq neighborhood are I^xc PmaxO' IminO' and P^ino- 
If ^maxi = ^1' SJ^^ if the binary value to be coded is 
a '1, ' only a token increase in Pn^xi (i.e., the power at I^) is 
5 required at the step 56. Similarly, if In,ino = ^o> then only a 
token decrease in Pmaxo (i-©-/ the power at Iq) is required at 
the step 56. When P^axi is boosted, it is multiplied by a 
factor 1 + A at the step 56, where A is in the range of about 
1.5 to about 2.0. The choice of A is based on experimental 

10 audibility tests combined with compression survivability 
tests. The condition for imperceptibility requires a low 
value for A, whereas the condition for compression survivabil- 
ity requires a large value for A. A fixed value of A may not 
lend itself to only a token increase or decrease of power. 

15 Therefore, a more logical choice for A would be a value based 
on the local masking threshold. In this case, A is variable, 
and coding can be achieved with a minimal incremental power 
level change and yet survive compression. 

In either case, the spectral power at I^ is given by 

20 the following equation: 



Pi; = 0 - AyP,„„^, (7) 

with suitable modification of the real and imaginary parts of 
the frequency component at I^. The real and imaginary parts 
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are multiplied by the same factor in order to keep the phase 
angle constant. The power at Iq is reduced to a value corre- 
sponding to (1 + A) P^ino in a similar fashion. 

The Fourier Transform of the block to be coded as 
5 determined at the step 44 also contains negative frequency- 
components with indices ranging in index values from -256 to - 
1. Spectral amplitudes at frequency indices and -Iq must 
be set to values representing the complex conjugate of ampli- 
tudes at Ii and Iq, respectively, according to the following 
10 equations: 



Rel/i-I,)] = ReWi)] (8) 
/m[/(-/,)] = -/«[/(/,)] (9) 

ReUi-Io)] = ReWo)] (lO) 



fm\A-Io>] = -J"^Wo)] (II) 

where f(I) is the complex spectral amplitude at index I. 

Compression algorithms based on the effect of mask- 
ing modify the amplitude of individual spectral components by 
means of a bit allocation algorithm. Frequency bands sub- 

15 j acted to a high level of masking by the presence of high 
spectral energies in neighboring bands are assigned fewer 
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bits, with the result that their amplitudes are coarsely 
quantized. However, the decompressed audio under most condi- 
tions tends to maintain relative amplitude levels at frequen- 
cies within a neighborhood. The selected frequencies in the 
5 encoded audio stream which have been amplified or attenuated 
at the step 56 will, therefore, maintain their relative posi- 
tions even after a compression/decompression process. 

It may happen that the Fourier Transform S{v(t) } of 
a block may not result in a frequency component of sufficient 

10 amplitude at the frequencies fj and fo to permit encoding of a 
bit by boosting the power at the appropriate frequency. In 
this event, it is preferable not to encode this block and to 
instead encode a subsequent block where the power of the 
signal at the frequencies fj and fo is appropriate for encod- 

15 ing. 

(ii) Modulation by Frequency Swapping 
In this approach, which is a variation of the ampli- 
tude modulation approach described above in section (i) , the 
spectral amplitudes at I^ and Inaxi swapped when encoding a 

20 one bit while retaining the original phase angles at and 

Itnaxi- A similar swap between the spectral amplitudes at Iq and 
^maxo is also performed. When encoding a zero bit, the roles 
of Ii and Iq are reversed as in the case of amplitude modula- 
tion. As in the previous case, swapping is also applied to 
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the corresponding negative frequency indices. This encoding 
approach results in a lower audibility level because the 
encoded signal undergoes only a minor frequency distortion. 
Both the unencoded and encoded signals have identical energy 
5 values. 

(iii) Phase Modulation 
The phase angle associated with a spectral component 
Iq is given by the following equation: 



where 0 s $0 ^ 2n. The phase angle associated with can be 
10 computed in a similar fashion. In order to encode a binary 
number, the phase angle of one of these components, usually 
the component with the lower spectral amplitude, can be modi- 
fied to be either in phase (i.e., 0°) or out of phase (i.e., 
180°) with respect to the other component, which becomes the 
15 reference. In this manner, a binary 0 may be encoded as an 
in-phase modification and a binary 1 encoded as an out-of- 
phase modification. Alternatively, a binary 1 may be encoded 
as an in-phase modification and a binary 0 encoded as an out- 
of -phase modification. The phase angle of the component that 
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is modified is designated <p^, and the phase angle of the other 
component is designated <^^. Choosing the lower amplitude 
component to be the modifiable spectral component minimizes 
the change in the original audio signal . 
5 In order to accomplish this form of modulation, one 

of the spectral components may have to undergo a maximum phase 
change of 180°, which could make the code audible. In prac- 
tice, however, it is not essential to perform phase modulation 
to this extent, as it is only necessary to ensure that the two 

10 components are either "close" to one another in phase or "far" 
apart. Therefore, at the step 48, a phase neighborhood ex- 
tending over a range of ±n/4 around 0r, the reference compo- 
nent, and another neighborhood extending over a range of ±n/4 
around cfiR + n may be chosen. The modifiable spectral compo- 

15 nent has its phase angle (I)^ modified at the step 56 so as to 
fall into one of these phase neighborhoods depending upon 
whether a binary '0' or a binary '1' is being encoded. If a 
modifiable spectral component is already in the appropriate 
phase neighborhood, no phase modification may be necessary. 

20 In typical audio streams, approximately 30% of the segments 

are "self-coded" in this manner and no modulation is required. 
The inverse Fourier Transform is determined at the step 62. 
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(iv) Odd/Even Index Modulation 
In this odd/even index modulation approach, a single 
code frequency index, l^, selected as in the case of the other 
modulation schemes, is used. A neighborhood defined by in- 
5 dexes I^, + 1, I^ + 2, and Ii + 3, is analyzed to determine 
whether the index I„ corresponding to the spectral component 
having the maximum power in this neighborhood is odd or even. 
If the bit to be encoded is a '1' and the index I„ is odd, then 
the block being coded is assumed to be "auto-coded." Other- 

10 wise, an odd- indexed frequency in the neighborhood is selected 
for amplification in order to make it a maximum. A bit '0' is 
coded in a similar manner using an even index. In the neigh- 
borhood consisting of four indexes, the probability that the 
parity of the index of the frequency with maximum spectral 

15 power will match that required for coding the appropriate bit 
value is 0.25. Therefore, 25% of the blocks, on an average, 
would be auto-coded. This type of coding will significantly 
decrease code audibility. 

A practical problem associated with block coding by 

20 either amplitude or phase modulation of the type described 
above is that large discontinuities in the audio signal can 
arise at a boundary between successive blocks. These sharp 
transitions can render the code audible. In order to elimi- 
nate these sharp transitions, the time-domain signal v{t) can 

25 be multiplied by a smooth envelope or window function w{t) at 
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the step 42 prior to performing the Fourier Transform at the 
step 44. No window function is required for the modulation by 
frequency swapping approach described herein. The frequency 
distortion is usually small enough to produce only minor edge 
discontinuities in the time domain between adjacent blocks. 

The window function w(t) is depicted in Figure 4. 
Therefore, the analysis performed at the step 54 is limited to 
the central section of the block resulting from Sn,{v{t)w(t) } . 
The required spectral modulation is implemented at the step 56 
on the transform S{v{t)w(t)}. 

The modified frequency spectrum which now contains 
the binary code (either '0' or '1') is subjected to an inverse 
transform operation at a step 62 in order to obtain the en- 
coded time domain signal, as will be discussed below. Follow- 
ing the step 62, the coded time domain signal is determined at 
a step 64 according to the following equation: 

VoW = v(0 + (S;J(v(rMO) - v(Ow(0) (13) 

where the first part of the right hand side of equation (13) 
is the original audio signal v(t) , where the second part of 
the right hand side of equation (13) is the encoding, and 
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where the left hand side of equation (13) is the resulting 
encoded audio signal VQ(t). 

While individual bits can be coded by the method 
described thus far, practical decoding of digital data also 
5 requires (i) synchronization, so as to locate the start of 

data, and (ii) built-in error correction, so as to provide for 
reliable data reception. The raw bit error rate resulting 
from coding by spectral modulation is high and can typically 
reach a value of 20%. In the presence of such error rates, 

10 both synchronization and error-correction may be achieved by 
using pseudo-noise (PN) sequences of ones and zeroes. A PN 
sequence can be generated, for example, by using an m- stage 
shift register 58 (where m is three in the case of Figure 5) 
and an exclusive-OR gate 60 as shown in Figure 5. For conve- 

15 nience, an n-bit PN sequence is referred to herein as a PNn 

sequence. For an Npj, bit PN sequence, an m-stage shift regis- 
ter is required operating according to the following equation: 

^PN = 2"- - 1 (14) 

where m is an integer. With m = 3, for example, the 7-bit PN 
sequence (PN7) is 1110100. The particular sequence depends 
20 upon an initial setting of the shift register 58. In one 

robust version of the encoder 12, each individual bit of data 
is represented by this PN sequence - i.e., 1110100 is used 
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for a bit '1, ' and the complement 0001011 is used for a bit 
'0. ' The use of seven bits to code each bit of code results 
in extremely high coding overheads. 

An alternative method uses a plurality of PN15 
5 sequences, each of which includes five bits of code data and 
10 appended error correction bits. This representation pro- 
vides a Hamming distance of 7 between any two 5 -bit code data 
words. Up to three errors in 'a fifteen bit sequence can be 
detected and corrected. This PN15 sequence is ideally suited 

10 for a channel with a raw bit error rate of 20%, 

In terms of synchronization, a unique synchroniza- 
tion sequence 66 (Figure 7a) is required for synchronization 
in order to distinguish PN15 code bit sequences 74 from other 
bit sequences in the coded data stream. In a preferred em- 

15 bodiment shown in Figure 7b, the first code block of the 

synchronization sequence 66 uses a "triple tone" 70 of the 
synchronization sequence in which three frequencies with 
indices Iq, Ij, and Imi^ are all amplified sufficiently that 
each becomes a maximum in its respective neighborhood, as 

20 depicted by way of example in Figure 6. It will be noted 

that, although it is preferred to generate the triple tone 70 
by amplifying the signals at the three selected frequencies to 
be relative maxima in their respective frequency neighbor- 
hoods, those signals could instead be locally attenuated so 

25 that the three associated local extreme values comprise three 
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local minima. It should be noted that any combination of 
local maxima and local minima could be used for the triple 
tone 70. However, because program audio signals include 
substantial periods of silence, the preferred approach in- 
5 volves local amplification rather than local attenuation. 

Being the first bit in a sequence, the hop sequence value for 
the block from which the triple tone 70 is derived is two and 
the mid-frequency index is fifty-five. In order to make the 
triple tone block truly unique, a shift index of seven may be 

10 chosen instead of the usual five. The three indices Iq, Ii, 
and I^iid whose amplitudes are all amplified are forty- eight, 
sixty- two and fifty- five as shown in Figure 6. (In this 
example, 1^^ = Hg + 53 = 2 + 53 = 55 . ) The triple tone 70 is 
the first block of the fifteen block sequence 66 and essen- 

15 tially represents one bit of synchronization data. The re- 
maining fourteen blocks of the synchronization sequence 66 are 
made up of two PN7 sequences: 1110100, 0001011. This makes 
the fifteen synchronization blocks distinct from all the PN 
sequences representing code data. 

20 As stated earlier, the code data to be transmitted 

is converted into five bit groups, each of which is repre- 
sented by a PN15 sequence. As shown in Figure 7a, an 
unencoded block 72 is inserted between each successive pair of 
PN sequences 74. During decoding, this unencoded block 72 (or 

25 gap) between neighboring PN sequences 74 allows precise syn- 
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chronizing by perinitting a search for a correlation maximum 
across a range of audio samples. 

In the case of stereo signals, the left and right 
channels are encoded with identical digital data. In the case 
5 of mono signals, the left and right channels are combined to 
produce a single audio signal stream. Because the frequencies 
selected for modulation are identical in both channels, the 
resulting monophonic sound is also expected to have the de- 
sired spectral characteristics so that, when decoded, the same 
10 digital code is recovered. 



DECODING THE SPECTRALLY MODULATED SIGNAL 
In most instances, the embedded digital code can be 
recovered from the audio signal available at the audio output 
28 of the receiver 20. Alternatively, or where the receiver 

15 20 does not have an audio output 28, an analog signal can be 
reproduced by means of the microphone 30 placed in the vicin- 
ity of the speakers 24. In the case where the microphone 30 
is used, or in the case where the signal on the audio output 
28 is analog, the decoder 20 converts the analog audio to a 

20 sampled digital output stream at a preferred sampling rate 
matching the sampling rate of the encoder 12. In decoding 
systems where there are limitations in terms of memory and 
computing power, a half-rate sampling could be used. In the 
case of half-rate sampling, each code block would consist of 
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=2 56 samples, and the resolution in the frequency domain 
(i.e., the frequency difference between successive spectral 
components) would remain the same as in the full sampling rate 
case. In the case where the receiver 20 provides digital 
5 outputs, the digital outputs are processed directly by the 
decoder 26 without sampling but at a data rate suitable for 
the decoder 26. 

The task of decoding is primarily one of matching 
the decoded data bits with those of a PN15 sequence which 

10 could be either a synchronization sequence or a code data 

sequence representing one or more code data bits. The case of 
amplitude modulated audio blocks is considered here. However, 
decoding of phase modulated blocks is virtually identical, 
except for the spectral analysis, which would compare phase 

15 angles rather than amplitude distributions, and decoding of 
index modulated blocks would similarly analyze the parity of 
the frequency index with maximum power in the specified neigh- 
borhood. Audio blocks encoded by frequency swapping can also 
be decoded by the same process. 

20 In a practical implementation of audio decoding, 

such as may be used in a home audience metering system, the 
ability to decode an audio stream in real-time is highly 
desirable. It is also highly desirable to transmit the de- 
coded data to a central office. The decoder 2 6 may be ar- 

25 ranged to run the decoding algorithm described below on Digi- 
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tal Signal Processing (DSP) based hardware typically used in 
such applications. As disclosed above, the incoming encoded 
audio signal may be made available to the decoder 26 from 
either the audio output 28 or from the microphone 30 placed in 
5 the vicinity of the speakers 24 . In order to increase pro- 
cessing speed and reduce memory requirements, the decoder 26 
may sample the incoming encoded audio signal at half (24 kHz) 
of the normal 48 kHz sampling rate. 

Before recovering the actual data bits representing 

10 code information, it is necessary to locate. the synchroniza- 
tion sequence. In order to search for the synchronization 
sequence within an incoming audio stream, blocks of 256 sam- 
ples, each consisting of the most recently received sam.ple and 
the 255 prior samples, could be analyzed. For real-time 

15 operation, this analysis, which includes computing the Fast 

Fourier Transform of the 256 sample block, has to be corrpleted 
before the arrival of the next sample. Performing a 256-point 
Fast Fourier Transform on a 40 MHZ DSP processor takes about 
600 microseconds. However, the time between samples is only 

20 40 microseconds, making real time processing of the incoming 

coded audio signal as described above impractical with current 
hardware . 

Therefore, instead of computing a normal Fast Fou- 
rier Transform on each 256 sample block, the decoder 2 6 may be 
25 arranged to achieve real-time decoding by implementing an 
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incremental or sliding Fast Fourier Transform routine 100 
(Figure 8) coupled with the use of a status information array 
SIS that is continuously updated as processing progresses. 
This array comprises p elements SIS[0] to SIS[p-l]. If p = 
5 64, for example, the elements in the status information array 
SIS are SIS[0] to SIS [63]. 

Moreover, unlike a conventional transform which 
computes the complete spectrum consisting of 256 frequency 
"bins," the decoder 26 computes the spectral amplitude only at 

10 frequency indexes that belong to the neighborhoods of inter- 
est, i.e., the neighborhoods used by the encoder 12. In a 
typical example, frequency indexes ranging from 45 to 70 are 
adequate so that- the corresponding frequency spectrum contains 
only twenty- six frequency bins. Any code that is recovered 

15 appears in one or more elements of the status information 
array SIS as soon as the end of a message block is encoun- 
tered. 

Additionally, it is noted that the frequency spec- 
trum as analyzed by a Fast Fourier Transform typically changes 
20 very little over a small number of samples of an audio stream. 

Therefore, instead of processing each block of 256 samples 
consisting of one "new" sample and 255 "old" samples, 256 
sample blocks may be processed such that, in each block of 256 
samples to be processed, the last k samples are "new" and the 
25 remaining 256-k samples are from a previous analysis. In the 
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case where ic = 4 , processing speed may be increased by skip- 
ping through the audio stream in four sample increments, where 
a skip factor k is defined as = 4 to account for this opera- 
tion. 

5 Each element SIS [p] of the status information array 

SIS consists of five members: a previous condition status 
PCS, a next jump index JI, a group counter GC, a raw data 
array DA, and an output data array OP. The raw data array DA 
has the capacity to hold fifteen integers. The output data 

10 array OP stores. ten integers, with each integer of the output 
data array OP corresponding to a five bit number extracted 
from a recovered PN15 sequence. This PN15 sequence, accord- 
ingly, has five actual data bits and ten other bits. These 
other bits may be used, for example, for error correction. It 

15 is assumed here that the useful data in a message block con- 
sists of 50 bits divided into 10 groups with each group con- 
taining 5 bits, although a message block of any size may be 
used. 

The operation of the status information array SIS is 
20 best explained in connection with Figure 8 . An initial block 
of 256 samples of received audio is read into a buffer at a 
processing stage 102. The initial block of 256 samples is 
analyzed at a processing stage 104 by a conventional Fast 
Fourier Transform to obtain its spectral power distribution. 
25 All subsequent transforms implemented by the routine 100 use 
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the high-speed incremental approach referred to above and 
described below. 

In order to first locate the synchronization se- 
quence, the Fast Fourier Transform corresponding to the ini- 
5 tial 2 56 sample block read at the processing stage 102 is 
tested at a processing stage 106 for a triple tone, which 
represents the first bit in the synchronization sequence. The 
presence of a triple tone may be determined by examining the 
initial 256 sample block for the indices Iq, Ij, and Imid used 

10 by the encoder 12 in generating the triple tone, as described 
above. The SIS [p] element of the SIS array that is associated 
with this initial block of 256 samples is SIS[0], where the 
status array index p is equal to 0 . If a triple tone is found 
at the processing stage 106, the values of certain members of 

15 the SIS[0] element of the status information array SIS are 
changed at a processing stage 108 as follows: the previous 
condition status PCS, which is initially set to 0, is changed 
to a 1 indicating that a triple tone was found in the sample 
block corresponding to SIS[0]; the value of the next jump 

20 index JI is incremented to 1; and, the first integer of the 
raw data member DA[0] in the raw data array DA is set to the 
value (0 or 1) of the triple tone. In this case, the first 
integer of the raw data member DA[0] in the raw data array DA 
is set to 1 because it is assumed in this analysis that the 

25 triple tone is the equivalent of a 1 bit. Also, the status 
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array index p is incremented by one for the next sample block. 
If there is no triple tone, none of these changes in the 
SIS [0] element are made at the processing stage 108, but the 
status array index p is still incremented by one for the next 
5 sample block. Whether or not a triple tone is detected in 

this 256 sample block, the routine 100 enters an incremental 
FFT mode at a processing stage 110. 



read into the buffer at a processing stage 112 by adding four 
10 new samples to, and discarding the four oldest samples from, 
the initial 256 sample block processed at the processing 
stages 102 - 106. This new 256 sample block increment is 
analyzed at a processing stage 114 according to the following 
steps : 

15 STEP 1: the skip factor k of the Fourier Transform is applied 
according to the following equation in order to modify each 
frequency component Foi^dJo) of the spectrum corresponding to 
the initial sample block in order to derive a corresponding 
intermediate frequency component FiCuq): 



Accordingly, a new 256 sample block increment is 



^.K) = ^o/>o)exp-(— f ) 




(15) 
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where Ug is the frequency index of interest. In accordance 
with the typical example described above, the frequency index 
Uq varies from 45 to 70. It should be noted that this first 
step involves multiplication of two complex numbers. 

5 STEP 2 ; the effect of the first four samples of the old 256 
sample block is then eliminated from each Fi(uo) of the spec- 
trum corresponding to the initial sample block and the effect 
of the four new samples is included in each Fi(uo) of the 
spectrum corresponding to the current sample block increment 
10 in order to obtain the new spectral amplitude Fne„{Uo) for each 
frequency index Uq according to the following equation: 

PnM = ^i(«o) ^ E (/;.» -/./»)exp-( %- -) (16) 

m = \ 256 



where fd^ and fne„ are the time-domain sample values. It 
should be noted that this second step involves the addition of 
a complex number to the summation of a product of a real 

15 number and a complex number. This computation is repeated 

across the frequency index range of interest (for example, 45 
to 70) . 
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STEP 3 : the effect of the multiplication of the 256 sample 
block by the window function in the encoder 12 is then taken 
into account. That is, the results of step 2 above are not 
confined by the window function that is used in the encoder 
5 12. Therefore, the results of step 2 preferably should be 

multiplied by this window function. Because multiplication in 
the time domain is equivalent to a convolution of the spectrum 
by the Fourier Transform of the window function, the results 
from the second step may be convolved with the window func- 
10 tion. In this case, the preferred window function for this 

operation is the following well known "raised cosine" function 
which has a narrow 3 -index spectrum with amplitudes (-0.50, 1, 
+0.50): 

wit) = 1[1 - C0S(^)] (17) 

where T„ is the width of the window in the time domain. This 
15 "raised cosine" function requires only three multiplication 

and addition operations involving the real and imaginary parts 
of the spectral amplitude. This operation significantly 
improves computational speed. This step is not required for 
the case of modulation by frequency swapping. 
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STEP 4 : the spectrum resulting from step 3 is then examined 
for the presence of a triple tone. If a triple tone is found, 
the values of certain members of the SIS[1] element of the 
status information array SIS are set at a processing stage 116 
5 as follows: the previous condition status PCS, which is 

initially set to 0, is changed to a 1; the value of the next 
jump index JI is incremented to 1; and, the first integer of 
the raw data member DA[1] in the raw data array DA is set to 
1. Also, the status array index p is incremented by one. If 
10 there is no triple tone, none of these changes are made to the 
members of the structure of the SIS [1] element at the process- 
ing stage 116, but the status array index p is still incre- 
mented by one. 



15 processing stage 118 and the group counter GC has not accumu- 
lated a count of 10 as determined at a processing stage 120, 
this analysis corresponding to the processing stages 112 - 120 

proceeds in the manner described above in four sample incre- 
ments where p is incremented for each sample increment. When 

20 SIS [63] is reached where p = 64, p is reset to 0 at the pro- 
cessing stage 118 and the 256 sample block increment now in 
the buffer is exactly 256 samples away from the location in 
the audio stream at which the SIS[0] element was last updated. 
Each time p reaches 64, the SIS array represented by the 

25 SIS[0] - SIS [63] elements is examined to determine whether the 



Because p is not yet equal to 64 as determined at a 
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previous condition status PCS of any of these elements is one 
indicating a triple tone. If the previous condition status 
PCS of any of these elements corresponding to the current 64 
sample block increments is not one, the processing stages 112 
5 - 120 are repeated for the next 64 block increments. (Each 
block increment comprises 256 samples.) 

Once the previous condition status PCS is equal to 1 
for any of the SIS[0] - SIS [63] elements corresponding to any 
set of 64 sample block increments, and the corresponding raw 
10 data member DA[p] is set to the value of the triple tone bit, 
the next 64 block increments are analyzed at the processing 
stages 112 - 120 for the next bit in the synchronization 
sequence . 

Each of the new block increments beginning where p 
15 was reset to 0 is analyzed for the next bit in the synchroni- 
zation sequence. This analysis uses the second member of the 
hop sequence Hg because the next jump index JI is equal to 1. 
From this hop sequence number and the shift index used in 
encoding, the and Iq indexes can be determined, for example 
20 from equations (2) and (3) . Then, the neighborhoods of the I^ 
and Iq indexes are analyzed to locate maximums and minimums in 
the case of amplitude modulation. If, for example, a power 
maximum at I^ and a power minimum at Iq are detected, the next 
bit in the synchronization sequence is taken to be 1. In 
25 order to allow for some variations in the signal that may 
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arise due to compression or other forms of distortion, the 
index for either the maximum power or minimum power in a 
neighborhood is allowed to deviate by 1 from its expected 
value. For example, if a power maximum is found in the index 
5 Ii, and if the power minimum in the index Iq neighborhood is 

found at Iq - 1, instead of 1q, the next bit in the synchroni- 
zation sequence is still taken to be 1. On the other hand, if 
a power minimum at and a power maximum at Ig are detected 
using the same allowable variations discussed above, the next 

10 bit in the synchronization sequence is taken to be 0. How- 
ever, if none of these conditions are satisfied, the output 
code is set to -1, indicating a sample block that cannot be 
decoded. Assuming that a 0 bit or a 1 bit is found, the 
second integer of the raw data member DA[1] in the raw data 

15 array DA is set to the appropriate value, and the next jump 
index JI of SIS[0] is incremented to 2, which corresponds to 
the third member of the hop sequence Hg. From this hop se- 
quence number and the shift index used in encoding, the and 
Iq indexes can be determined. Then, the neighborhoods of the 

20 Ii and Iq indexes are analyzed to locate maximums and minimums 
in the case of amplitude modulation so that the value of the 
next bit can be decoded from the third set of 64 block increj- 
ments, and so on for fifteen such bits of the synchronization 
sequence. The fifteen bits stored in the raw data array DA 

25 may then be compared with a reference synchronization sequence 
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to determine synchronization. If the number of errors between 
the fifteen bits stored in the raw data array DA and the 
reference synchronization sequence exceeds a previously set 
threshold, the extracted sequence is not acceptable as a 
5 synchronization, and the search for the synchronization se- 
quence begins anew with a search for a triple tone. 

If a valid synchronization sequence is thus de- 
tected, there is a valid synchronization, and the PN15 data 
sequences may then be extracted using the same analysis as is 

10 used for the synchronization sequence, except that detection 
of each PN15 data sequence is not conditioned upon detection 
of the triple tone which is reserved for the synchronization 
sequence. As each bit of a PN15 data sequence is found, it is 
inserted as a corresponding integer of the raw data array DA. 

15 When all integers of the raw data array DA are filled, (i) 

these integers are compared to each of the thirty- two possible 
PN15 sequences, (ii) the best matching sequence indicates 
which 5-bit number to select for writing into the appropriate 
array location of the output data array OP, and (iii) the 

20 group counter GC member is incremented to indicate that the 

first PN15 data sequence has been successfully extracted. If 
the group counter GC has not yet been incremented to 10 as 
determined at the processing stage 120, program flow returns 
to the processing stage 112 in order to decode the next PN15 

25 data sequence. 
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When the group counter GC has incremented to 10 as 
determined at the processing stage 120, the output data array 
OP, which contains a full 50-bit message, is read at a pro- 
cessing stage 122 . The total number of samples in a message 
5 block is 45,056 at a half-rate sampling frequency of 24 kHz. 
It is possible that several adjacent elements of the status 
information array SIS, each representing a message block 
separated by four samples from its neighbor, may lead to the 
recovery of the same message because synchronization may occur 

10 at several locations in the audio stream which are close to 

one another. If all these messages are identical, there is a 
high probability that an error- free code has been received. 

Once a message has been recovered and the message 
has been read at the processing stage 122, the previous condi- 

15 tion status PCS of the corresponding SIS element is set to 0 
at a processing stage 124 so that searching is resumed at a 
processing stage 126 for the triple tone of the synchroniza- 
tion sequence of the next message block. 

MULTI -LEVEL CODING 

20 Often there is a need to insert more than one code 

message into the same audio stream. For example in a televi- 
sion program distribution environment, the network originator 

of the program may insert its identification code and time 
stamp, and a network affiliated station carrying this program 
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may also insert its own identification code. In addition, an 
advertiser or sponsor may wish to have its code added. It is 
noted that the network originator, the network affiliated 
station, and the advertiser are at different distribution 
5 levels between audio origination and audio reception by the 
consumer. There are a number of methods of accommodating 
multi-level encoding in order to designate more than one 
distributor of the audio. 



(i) Bit Reservation 

10 In order to accommodate multi-level coding, 48 bits 

in a 50 -bit system can be used for the code and the remaining 
2 bits can be used for level specification. Usually the first 
program material generator, say the network, will insert codes 
in the audio stream. Its first message block would have the 

15 level bits set to 00, and only a synchronization sequence and 
the 2 level bits are set for the second and third message 
blocks in the case of a three level system. For example, the 
level bits for the second and third messages may be both set 
to 11 indicating that the actual data areas have been left 

20 unused . 

The network affiliated station can now enter its 
code with a decoder/encoder combination that would locate the 
synchronization of the second message block with the 11 level 
setting. This station inserts its code in the data area of 
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this block and sets the level bits to 01. The next level 
encoder inserts its code in the third message block's data 
area and sets the level bits to 10. During decoding, the 
level bits distinguish each message level category. 

5 (ii) Frequency Multiplexing 

In frequency multiplexing", each code level (e.g., 
network, affiliate, advertiser) is assigned to a different 
frequency band in the spectrum. In determining the size of a 
frequency band and, therefore, the number of bands that may be 

10 coded, it is noted that each code level generally requires a 
minimum of eighteen consecutive spectral lines when using the 
coding methods described herein. This requirement follows 
from the way in which a triple tone is coded. That is, in 
coding a triple tone, the frequencies corresponding to indices 

15 Ii, Iq, and I^id are all amplified. Because Ii = forty-eight 

and Iq = sixty- two, the two outer frequencies corresponding to 
Ii and Iq are separated by fourteen spectral lines. In addi- 
tion, the neighborhoods defined for these frequencies extend 
two spectral lines on either side of these two frequencies for 

20 a total' of eighteen spectral lines. 

At a sampling rate of 48 kHz and 512 samples per 
block, eighteen spectral lines correspond to a spectral width 
of 1.69 kHz. In order to insert a code, there must be enough 
energy within this 1.69 kHz band to provide masking for the 
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code signal. Three levels of code can be inserted in an audio 
signal typically having a bandwidth of 8 kHz by choosing the 
following bands: 2.9 kHz to 4.6 kHz for a first level of 
coding; 4.6 kHz to 6 . 3 kHz for a second level of coding; 
5 and, 6.3 kHz to 8,0 kHz for a third level of coding. However, 
it should be noted that audio consisting of speech usually has 
a bandwidth lower than 5 kHz and may, therefore, support only 
a single level of code. 



(iii) Primary/ Secondary Encoding 
10 In this method of encoding, two types of encoders, a 

primary encoder and one or more secondary encoders, may be 
used to insert different levels of code. The various levels 
of code can be arranged hierarchically in such a manner that 
the primary encoder inserts at least the synchronization 
15 sequence and may also insert one of the levels, such as the 

highest level, of code. During encoding, and preferably prior 
to insertion of the synchronization sequence, the primary 
encoder leaves a predetermined number of audio blocks uncoded 
to permit the secondary encoders to insert their assigned 
20 levels of code. Accordingly, the secondary encoders have the 
capability to both decode and encode audio such that they 
first locate the synchronization sequence inserted by the 
primary encoder, and then determine their assigned positions 
in the audio stream for insertion of their corresponding 
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codes. In the decoding process, the synchronization sequence 
is first detected, and then the several levels of codes are 
recovered sequentially. 

CODE ERASURE AND OVERWRITE 
5 It may also be necessary to provide a means of 

erasing a code or to erase and overwrite a code . Erasure may 
be accomplished by detecting the triple tone/synchronization 
sequence using a decoder and by then modifying at least one of 
the triple tone frequencies such that the code is no longer 

10 recoverable. Overwriting involves extracting the synchroniza- 
tion sequence in the audio, testing the data bits in the data 
area and inserting a new bit only in those blocks that do not 
have the desired bit value. The new bit is inserted by ampli- 
fying and attenuating appropriate frequencies in the data 

15 area. 

DELAY COMPENSATION 
In a practical implementation of the encoder 12, N^ 
samples of audio, where Nc is typically 512, are processed at 
any given time. In order to achieve operation with a minimum 
20 amount of throughput delay, the following four buffers are 

used: input buffers INO and INI, and output buffers OUTO and 
OUTl . Each of these buffers can hold N^ samples. While 
samples in the input buffer INO are being processed, the input 
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buffer INI receives new incoming samples. The processed 
output samples from the input buffer INO are written into the 
output buffer OUTO, and samples previously encoded are written 
to the output from the output buffer OUTl . When the operation 
associated with each of these buffers is completed, processing 
begins on the samples stored in the input buffer INI while the 
input buffer INO starts receiving new data. Data from the 
output buffer OUTO are now written to the output. This cycle 
of switching between the pair of buffers in the input and 
output sections of the encoder continues as long as new audio 
samples arrive for encoding. It is clear that a sample arriv- 
ing at the input suffers a delay equivalent to the time dura- 
tion required to fill two buffers at the sampling rate of 48 
kHz before its encoded version appears at the output. This 
delay is approximately 22 ms. When the encoder 12 is used in 
a television system environment, it is necessary to compensate 
for this delay in order to maintain synchronization between 
video and audio. 

Such a compensation arrangement is shown in Figure 
9. As shown in Figure 9, an encoding arrangement 200, which 
may be used for the elements 12, 14, and 18 in Figure 1, is 
arranged to receive either analog video and audio inputs or 
digital video and audio inputs. Analog video and audio inputs 
are supplied to corresponding video and audio analog to digi- 
tal converters 202 and 204. The audio samples from the audio 
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analog to digital converter 204 are provided to an audio 
encoder 206 which may be of known design or which may be 
arranged as disclosed above. The digital audio input is 
supplied directly to the audio encoder 206. Alternatively, if 
5 the input digital bit stream is a combination of digital video 
and audio bit stream portions, the input digital bit stream is 
provided to a demultiplexer 208 which separates the digital 
video and audio portions of the input digital bit stream and 
supplies the separated digital audio portion to the audio 

10 encoder 206. 

Because the audio encoder 206 imposes a delay on the 
digital audio bit stream as discussed above relative to the 
digital video bit stream, a delay 210 is introduced in the 
digital video bit stream. The delay imposed on the digital 

15 video bit stream by the delay 210 is equal to the delay im- 
posed on the digital audio bit stream by the audio encoder 
206. Accordingly, the digital video and audio bit streams 
downstream of the encoding arrangement 200 will be synchro- 
nized. 

20 In the case where analog video and audio inputs are 

provided to the encoding arrangement 200, the output of the 
delay 210 is provided to a video digital to analog converter 
212 and the output of the audio encoder 206 is provided to an 

audio digital to analog converter 214. In the case where 
25 separate digital video and audio bit streams are provided to 
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the encoding arrangement 200, the output of the delay 210 is 
provided directly as a digital video output of the encoding 
arrangement 200 and the output of the audio encoder 206 is 
provided directly as a digital audio output of the encoding 
5 arrangement 200. However, in the case where a combined digi- 
tal video and audio bit stream is provided to the encoding 
arrangement 200, the outputs of the delay 210 and of the audio 
encoder 206 are provided to a multiplexer 216 which recombines 
the digital video and audio bit streams as an output of the 

10 encoding arrangement 200. 

As explained above, there may be some instances 
where the arrangement described above can result in undesir- 
able audibility of the ancillary code inserted into a program 
audio signal . Two such instances and exemplary solutions to 

15 these two instances are described below. 



Controlling Code Audibility Using 
an Audio Quality Measure (AQM) 

One example of audio material that is difficult to 

inaudibly encode is instrumental music characterized by strong 

20 harmonics or by a strong fundamental frequency in the code 

frequency band. Shifting the frequency maxima and minima in 

such cases can lead to audible distortion. Therefore, an 
audibility score, which is designated herein as the audio 
quality measure (AQM) , can be computed in order to determine 
25 when instances of potentially audible code segments occur. 
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AQM computation may be based on psycho-acoustic 
models that are widely used in audio compression algorithms 
such as Dolby's AC-3, iyiPEG-2 Layers I, II, or III, or MPEG- 
AAC. The AQM computation discussed below is based on MPEG- 
5 AAC. However, the AQM computation may be based any of these 

audio compression algorithms. (For example, in the Dolby AC-3 
audio compression method, a Modified Discrete Cosine Transform 
(MDCT) spectrum is used for computing the masking levels.) 

Let it be assumed that blocks of 512 samples at a 48 
10 kHz sampling rate are used to compute the AQM. The frequency 
space extending from 0 to 24 kHz is divided into 42 critical 
bands. Prior to encoding a block of audio as described above, 
the spectral energy [b] in each critical band, where b is 
the band index, is computed by the encoder 12 at the step 48 
15 in accordance with the following equation: 



^ot^] = E^'w (18) 
f-fi 

where A[f] is the amplitude at a frequency component f in the 

corresponding critical band of the audio block, fi is the 
initial frequency component in the corresponding critical band 
of the audio block, and fl is the last frequency component in 
20 the corresponding critical band of the audio block. 
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A masking energy level Ef 



^MASK 



[b] is also computed at 



the step 48 following the methodology described in ISO/IEC 
13818-7:1997. The masking energy level Ep^K[b] is the minimum 
change in energy within the band b that will be perceptible to 
the human ear. 



modulation procedure described earlier in this application, a 
new energy level value Ec[b] for each band in the coded block 
will result and can be computed at the step 48 using equation 



change in energy of a band b given by |Ec[b] - Eg [b] | is less 
than the masking energy level E^gREb] . If |Ec[b] - Eo[b]| is 
less than E„;^gK[b], it can be assumed that there is adequate 
masking energy available in the band b to make the change 
resulting from coding imperceptible. Therefore, an aqmtb] for 
this band b is assumed to be zero. However, if |Ec[b] - Eq [b] j 
Ef^K[b] for the band b, the aqm for the band can be computed 
at the step 56 as follows: 



If this block were to be coded by the spectral 



(18) . 



The encoder 12 at the step 56 determines whether the 



aqm[b] = 



\EcJib]-E,[b]\ 



(19) 
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The total AQM score for the whole block, can be obtained at the 
step 56 from equation (19) by summing across all 42 critical 
bands according to the following equation: 

b=4\ 

^Q^WTAi = E oqm[b] (20) 
6=0 

If it is determined at the step 56 that AQMtqtal is greater than 
5 a predetermined threshold AQMthresh/ then the corresponding 
block is not considered to be suitable for encoding. 

In practice, however, coding of a single audio 
block, or even several audio blocks, whose AQMpor^ > AQMthresh 
and whose durations are each approximately 10 ms, may not 

10 result in an audible code. But if one such audio block oc- 
curs, it is likely to occur near in time to other such audio 
blocks with the result that, if a sufficient number of such 
audio blocks are grouped consecutively in a sequence, coding 
of one or more audio blocks in the sequence may well produce 

15 an audible code thereby degrading the quality of the original 
audio. 

Therefore, in order to determine when to encode and 
when to suspend encoding, the encoder 12 at the step 56 main- 
tains a count of audible blocks. If x out of y consecutive 
20 blocks prior to the current block fall in the audible code 



-56- 



wo 01/31816 



PCTAJSOO/03829 



category, then the encoder 12 at the step 56 suspends coding 
for all subsequent blocks of the current ancillary code mes- 
sage. If X is equal to 9 and y is equal to 16, for example, 
and if 9 out 16 such audio blocks are coded in spite of the 
5 audibility scores being high, an audible code is likely to 

result. Therefore, in order to successfully encode a 50 bit 
ancillary code message, a sequence of z audio blocks is re- 
quired, where the sequence of z audio blocks has less than x 
audible blocks in any consecutive y block segment. 

10 In addition, encoding of any individual audio block 

may be inhibited if the AQM score for this individual audio 
block exceeds a threshold AQMthresh+ which is set higher than 
AQMthresh- Even though a single bit of code may be accordingly 
lost in such a case, the error correction discussed above will 

15 make it possible to still recover the ancillary code message. 



Pre-echo Cancellation 
Pre-echo is a well known phenomenon that is encoun- 
tered in most or all block based audio processing operations 
such as compression. It also occurs in the case of audio 
20 encoding as described above. Pre-echo arises when the audio 
energy within a block is not uniformly distributed, but is 
instead concentrated in the latter half of the block. Pre- 
echo effects are most apparent in the extreme case when the 
first half of the audio block has a very low level of audio 



-57- 



wo 01/31816 



PCTAJSOO/03829 



and the second half of the audio block has a very high level 
of audio. As a result, a code signal, which is uniformly 
distributed across the entire audio block, has no masking 
energy available to make it inaudible during the first half of 
5 the audio block. 



step 56, is examined by the encoder 12 for the block's energy 
distribution characteristic. The energy in an audio block is 
computed by summing the squares of the amplitudes of the time 
10 domain samples. Then, if the ratio of the energy Ej in a 

first part of the audio block to the energy E2 in the remain- 
ing part of the audio block is below a threshold, a code is 
not inserted in the audio block. The energy and the energy 
E2 are calculated according to the following equations: 



Therefore, each audio block, prior to coding at the 



s=d 



(21) 



s=o 



15 



and 



s=S 



(22) 



s=d+\ 
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where A[s] is the amplitude of a sample s, S is the total 
number of samples in a corresponding block of audio, and d 
divides the corresponding block of audio between samples in 
the first part of the block of audio and samples in the re- 
5 maining part of the block of audio. For example, d may divide 
the block of audio between samples in the first quarter of the 
block of audio and samples in the last three quarters of the 
block of audio. 

Certain modifications of the present invention have 

10 been discussed above. Other modifications will occur to those 
practicing in the art of the present invention. For example, 
according to the description above, the encoding arrangement 
200 includes a delay 210 which imposes a delay on the video 
bit stream in order to compensate for the delay imposed on the 

15 audio bit stream by the audio encoder 206. However, some 
embodiments of the encoding arrangement 200 may include a 
video encoder 218, which may be of known design, in order to 
encode the video output of the video analog to digital con- 
verter 202, or the input digital video bit stream, or the 

20 output of the demultiplexer 208, as the case may be. When the 
video encoder 218 is used, the audio encoder 2 06 and/or the 
video encoder 218 may be adjusted so that the relative delay 
imposed on the audio and video bit streams is zero and so that 
the audio and video bit streams are thereby synchronized. In 

25 this case, the delay 210 is not necessary. Alternatively, the 
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delay 210 may be used to provide a suitable delay and may be 
inserted in either the video or audio processing so that the 
relative delay imposed on the audio and video bit streams is 
zero and so that the audio and video bit streams are thereby 
synchroni zed . 

In still other embodiments of the encoding arrange- 
ment 2 00, the video encoder 218 and not the audio encoder 2 06 
may be used. In this case, the delay 210 may be required in 
order to impose a delay on the audio bit stream so that the 
relative delay between the audio and video bit streams is zero 
and so that the audio and video bit streams are thereby syn- 
chronized. 

Accordingly, the description of the present inven- 
tion is to be construed as illustrative only and is for the 
purpose of teaching those skilled in the art the best mode of 
carrying out the invention. The details may be varied sub- 
stantially without departing from the spirit of the invention, 
and the exclusive use of all modifications which are within 
the scope of the appended claims is reserved. 
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WHAT IS CLAIMED IS: 



1 1. A method for encoding first and second blocks of 

2 audio with corresponding first and second binary code bits 

3 comprising the following steps: 

4 a) selecting first and second frequencies from a 

5 frequency spectrum of the first block of audio; 

6 b) modulating the audio based upon the first and 

7 second frequencies to thereby encode the first block of audio 

8 with the first binary code bit; 

9 c) selecting third and fourth frequencies from a 

10 frequency spectrum of the second block of audio, wherein the 

11 third and fourth frequencies bear a predetermined offset 

12 relationship to the first and second frequencies; and, 

13 d) modulating the audio based upon the third and 

14 fourth frequencies to thereby encode the second block of audio 

15 with the second binary code bit. 

1 2. The method of claim 1 wherein: 

2 step b) comprises the steps of (i) increasing the 

3 spectral power at one of the first and second frequencies so 

4 as to render the spectral power at the one of the first and 

5 second frequencies a local maximum, and (ii) decreasing the 

6 spectral power at the other of the first and second frequen- 
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7 cies so as to render the spectral power at the other of the 

8 first and second frequencies a local minimum; and, 

9 step d) comprises the steps of (i) increasing the 

10 spectral power at one of the third and fourth frequencies so 

11 as to render the spectral power at the one of the third and 

12 fourth frequencies a local maximum, and (ii) decreasing the 

13 spectral power at the other of the third and fourth frequen- 

14 cies so as to render the spectral power at. the other of the 

15 third and fourth frequencies a local minimum. 

1 3. The method of claim 1 wherein: 

2 step b) comprises the step of selectively changing a 

3 phase relationship between the first and second frequencies; 

4 and, 

5 step d) comprises the step of selectively changing a 

6 phase relationship between the third and fourth frequencies. 

1 4. The method of claim 1 wherein: 

2 step b) comprises the step of swapping a spectral 

3 amplitude of at least one of the first and second frequencies 

4 with a spectral amplitude of a frequency having a maximum 

5 amplitude in a frequency neighborhood of the least one of the 

6 first and second frequencies; and, 

7 Step d) comprises the step of swapping a spectral 

8 amplitude of at least one of the third and fourth frequencies 
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9 with a spectral amplitude of a frequency having a maximum 

10 amplitude in a frequency neighborhood of the at least one of 

11 the third and fourth frequencies. 

1 5. The method of claim 1 wherein the predetermined 

2 offset relationship is determined by a frequency hop sequence. 

1 6. The method of claim 1 wherein the predetermined 

2 offset relationship is determined by hopping based on a low 

3 frequency maximum. 

1 7. The method of claim 1 wherein the first and 

2 second frequencies are selected according to a reference 

3 frequency, a frequency hop sequence, and a predetermined shift 

4 index, and wherein the third and fourth frequencies are se- 

5 lected according to the reference frequency, the frequency hop 

6 sequence, and the predetermined shift index. 

1 8. The method of claim 7 wherein the first and 

2 second frequencies are selected according to the following 

3 equations: 
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5 wherein 1-^ and Iq are indexes corresponding to the first and 

6 second frequencies respectively, wherein I^y^ is an index corre- 

7 spending to the reference frequency, wherein Hg is a component 

8 of the frequency hop sequence, and wherein Ighift prede- 

9 termined shift index. 

1 9. The method of claim 1 wherein the first and 

2 second frequencies are selected according to a reference 

3 frequency, a first low frequency maximum, and a predetermined 

4 shift index, and wherein the third and fourth frequencies are 

5 selected according to the reference frequency, a second low 

6 frequency maximum, and the predetermined shift index. 

1 10. The method of claim 9 wherein the first and 

2 second frequencies are selected according to the following 

3 equations: 
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5 wherein and Iq are indexes corresponding to the first and 

6 second frequencies respectively, wherein Igj^ is an index corre- 

7 spending to the reference frequency, wherein I^^j^ is an index 

8 corresponding to a frequency at which the spectral power of 

9 the first block of audio is a maximum in a low frequency band 

10 of the first block of audio, and wherein Ighift is the predeter- 

11 mined shift index. 

1 11. The method of claim 1 wherein a synchronization 

2 block is added to the signal, and wherein the synchronization 

3 block is characterized by a triple tone portion. 

1 12 . The method of claim 1 further comprising the 

2 following steps: 

3 e) determining an audio quality measure AQM for the 

4 first block of audio; 

5 f) comparing AQM to AQMthresh' wherein AQMthresh is a 

6 predetermined threshold audio quality measure reference; and, 

7 g) if AQM > AQMthrbsh' inhibiting step b) . 

1 13. The method of claim 12 wherein the first block 

2 of audio has a spectral energy, wherein 
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3 wherein the first block of audio is divided into B frequency 

4 bands, wherein b is a variable representing a frequency band 

5 within the B frequency bands, wherein Eq [b] is the spectral 

6 energy in the first block of audio without coding, wherein 

7 Ec[b] is the spectral energy in the first block of audio with 

8 coding, and wherein 

b=B 

AQM = aqm[b] . 
b=0 



1 14 . The method of claim 1 further comprising the 

2 following steps: 

3 e) determining an audio quality measure AQM for 

4 each of a plurality of blocks of audio; 

5 f ) comparing the AQM corresponding to each of the 

6 plurality of blocks of audio to AQMthresh/ wherein AQMthresh is a 

7 predetermined threshold audio quality measure reference; and, 

8 g) if AQM > AQMthresh ^ blocks of audio out of y 

9 blocks of audio of the plurality of blocks of audio, encoding 

10 of the audio with binary bits is suspended, wherein x and y 

11 are corresponding predetermined numbers of blocks of audio. 

1 15. The method of claim 14 wherein y is a predeter- 

2 mined number of consecutive blocks of audio. 
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1 16 . The method of claim 14 wherein encoding of any 

2 single block of audio is inhibited if AQM > AQMthresh+ that 

3 block of audio, and wherein AQMthresh+ > AQMthresh- 

1 17 . The method of claim 14 wherein each of the y 

2 blocks of audio has a spectral energy, wherein 



3 wherein each of the y blocks of audio is divided into B fre- 

4 quency bands, wherein b is a variable representing a frequency 

5 band within the B frequency bands, wherein Eq [b] is the spec- 

6 tral energy in a frequency band b of each of the blocks of 

7 audio without coding, wherein E^Eb] is the spectral energy in 

8 the frequency band b of each of the blocks of audio with 

9 coding, and wherein 

b=B 

AQM = X) . 
6=0 



1 18. The method of claim 1 wherein the first block 

2 of audio has energy, wherein step b) is inhibited if Ej/Ej < 

3 EpRE, wherein E-^ is the energy in a first portion of the first 

4 block of audio, wherein Ej is the energy in a second portion 
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5 of the first block of audio, and wherein EpRE is a predeter- 

6 mined reference. 

1 19. The method of claim 18 wherein represents 

2 the energy in a first quarter of the first block of audio and 

3 E2 represents the energy in a last three quarters of the first 

4 block of audio. 

1 20. The method of claim 18 wherein the first block 

2 of audio is sampled to produce S samples, wherein is deter- 

3 mined according to the following equation: 

4 i?, =E^'M 

5=0 

5 wherein Ej is determined according to the following equation: 

s^S 

7 wherein A[s] is the amplitude A of a sample s in the first 

8 block of audio, and wherein d is chosen so as the divide E^ 

9 from E2. 

1 21. The method of claim 20 wherein d is chosen so 

2 that represents the energy in a first quarter of the first 
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3 block of audio and so that E2 represents the energy in a last 

4 three quarters of the first block of audio. 

1 22. The method of claim 1 further comprising the 

2 step of encoding 1 blocks of audio with binary code bits, 

3 wherein n of the binary code bits are arranged to designate a 

4 distribution level of encoding. 

1 23 . The method of claim 22 wherein k groups of 1 

2 blocks of audio in an audio stream are set aside for encoding 

3 by k distributors of the audio, and wherein a predetermined 

4 combination of the n binary code bits of a corresponding group 

5 of 1 blocks of audio is arranged to indicate that the corre- 

6 spending group of 1 blocks of audio has not been encoded by a 

7 distributor. 

1 24. The method of claim 1 further comprising the 

2 step of encoding more than one frequency band within the first 

3 and second blocks of audio with corresponding binary code 

4 bits, wherein each frequency band is encoded by a different 

5 distributor of the audio. 

1 25. The method of claim 1 further comprising the 

2 following steps: 
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3 e) encoding, by use of a primary encoder, a group 

4 of blocks of audio with a synchronization sequence, wherein 

5 the primary encoder leaves a predetermined number of groups of 

6 additional blocks of audio uncoded; and, 

7 f) encoding, by use of either the primary encoder 

8 or a secondary encoder, a first corresponding one of the 

9 groups of blocks of additional audio so as to identify a first 

10 distributor of the audio; and, 

11 g) encoding, by use of a secondary encoder, a 

12 second corresponding one of the groups of additional blocks of 

13 audio so as to identify a second distributor of the audio. 

1 26. A method for encoding a block of audio with a 

2 binary code bit comprising the following steps: 

3 a) selecting a frequency from a frequency spectrum 

4 of the block of audio; 

5 b) selectively amplifying an odd index frequency in 

6 a neighborhood of the selected frequency to be a local maximum 

7 if the block of audio is to be encoded with the binary code 

8 bit having a first value; and, 

9 c) selectively amplifying an even index frequency 

10 in a neighborhood of the selected frequency to be a local 

11 maximum if the block of audio is to be encoded with the binary 

12 code bit having a second value. 
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1 27. The method of claim 26 wherein the block of 

2 audio is a first block of audio, wherein the binary code bit 

3 is a first binary code bit, wherein the frequency selected in 

4 step a) is a first frequency, and wherein the method further 

5 comprises the following steps: 

6 d) selecting a second frequency from a frequency 

7 spectrum of a second block of audio, wherein the second fre- 

8 quency bears a predetermined offset relationship to the first 

9 frequency; 

10 e) selectively amplifying an odd index frequency in 

11 a neighborhood of the second frequency to be a local maximum 

12 if the second block of audio is to be encoded with a second 

13 binary code bit having one of the first and second values; 

14 and, 

15 f) selectively amplifying an even index frequency 

16 in a neighborhood of the second frequency to be a local maxi- 

17 mum if the second block of audio is to be encoded with the 

18 second binary code bit having the other of the first and 

19 second values. 
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1 28. A method for encoding blocks of audio with 

2 binary code bits comprising the following steps: 

3 a) determining an audio quality measure AQM for 

4 each block of audio; 

5 b) comparing the AQM corresponding to each block of 

6 audio to AQMthresh' wherein AQMthresh is a predetermined audio 

7 quality measure reference; 

8 c) if AQM < AQMthresh ^ blocks of audio out of y 

9 blocks of audio, encoding the blocks of audio with binary 

10 bits, wherein x and y are corresponding predetermined numbers 

11 of blocks of audio; and, 

12 d) if AQM > AQMthresh the x blocks of audio out 

13 of the y blocks of audio, suspending encoding of the blocks of 

14 audio. 

1 29. The method of claim 28 wherein y is a predeter- 

2 mined number of consecutive blocks of audio. 

1 30. The method of claim 28 wherein encoding of any 

2 single block of audio is inhibited if AQM > AQMthresh+ that 

3 block of audio, and wherein AQMthresh+ > AQM^hresh- 

1 31. The method of claim 28 wherein each of the 

2 blocks of audio has a spectral energy, wherein 
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wherein each of the blocks of audio is divided into B fre- 
quency bands, wherein b is a variable representing a frequency- 
band within the B frequency bands, wherein Eq [b] is the spec- 
tral energy in a frequency band b of each of the blocks of 
audio without coding, wherein Ec[b] is the spectral energy in 
the frequency band b of each of the blocks of audio with 
coding, and wherein 

AQM = X) aqm[b] . 
b=Q 



1 32. The method of claim 31 wherein y is a predeter- 

2 mined number of consecutive blocks of audio. 

1 33. The method of claim 28 wherein step c) com- 

2 prises the steps of increasing the spectral power at one of 

3 first and second frequencies within a corresponding block of 

4 audio so as to render the spectral power at the one of the 

5 first and second frequencies a local maximum, and decreasing 

6 the spectral power at the other of the first and second fre- 

7 quencies so as to render the spectral power at the other of 

8 the first and second frequencies a local minimum. 
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1 34, The method of claim 28 wherein step c) com- 

2 prises the step of selectively changing a phase relationship 

3 between first and second frequencies, and wherein the first 

4 and second frequencies are within a block of audio. 

1 35. The method of claim 28 wherein step c) com- 

2 prises the steps of swapping a spectral amplitude at a first 

3 frequency with a spectral amplitude at a second frequency, 

4 wherein the first frequency is within an encoded block of 

5 audio, and wherein the second frequency has a maximum ampli- 

6 tude in a frequency neighborhood of the first frequency. 

1 3 6. A method of encoding a block of audio with a 

2 binary code bit, wherein the block of audio has an energy, and 

3 wherein the method comprises the following steps: 

4 a) determining a ratio Ei/Ej, wherein is the 

5 energy in a first portion of the block of audio, and wherein 

6 is the energy in a second portion of the block of audio; 

7 b) modulating the block of audio with the binary 

8 code bit if E1/E2 > EpR^, wherein EpRE is a predetermined refer- 

9 ence ; and 

10 c) not modulating the block of audio with the 

11 binary code bit if Ei/Ej < EpRg. 
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1 37. The method of claim 36 wherein E-^ is the energy 

2 in a first quarter of the block of audio, and wherein E2 is 

3 the energy in a last three quarters of the block of audio. 

1 38. The method of claim 36 wherein the block of 

2 audio is sampled to produce S samples, wherein is deter- 

3 mined according to the following equation: 

s=d 

5=0 

5 wherein Eg is determined according to the following equation: 

s=S 

6 ^2 = E 

s=d+\ 

7 wherein A[s] is the amplitude A of a sample s, and wherein d 

8 is chosen so as the divide E^ from Eg. 

1 39. The method of claim 38 wherein d is chosen so 

2 that El is the energy in a first quarter of the block of audio 

3 and so that E2 is the energy in a last three quarters of the 

4 block of audio. 
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1 40. The method of claim 36 wherein step b) com- 

2 prises the steps of increasing the spectral power at one of 

3 first and second, frequencies within the block of audio so as 

4 to render the spectral power at the one of the first and 

5 second frequencies a local maximum, and decreasing the spec- 

6 tral power at the other of the first and second frequencies 

7 within the block of audio so as to render the spectral power 

8 at the other of the first and second frequencies a local 

9 minimum, 

1 41. The method of claim 36 wherein step b) com- 

2 prises the step of selectively changing a phase relationship 

3 between first and second frequencies, and wherein the first 

4 and second frequencies are within the block of audio. 

1 42. The method of claim 36 wherein step b) com- 

2 prises the steps of swapping a spectral amplitude at a first 

3 frequency with a spectral amplitude at a second frequency, 

4 wherein the first frequency is within the block of audio, and 

5 wherein the second frequency has a maximum amplitude in a 

6 frequency neighborhood of the first frequency. 
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1 43 . A method of encoding blocks of audio with 

2 binary code bits comprising the following steps: 

3 a) encoding each of the blocks of audio with a 

4 binary bit by modulating the audio within the corresponding 

5 block of audio at selected first and second frequencies, 

6 wherein the selected first and second frequencies are hopped 

7 from block to block; and, 

8 b) executing step a) so as to indicate first and 

9 second levels of distribution of the audio. 



1 44. The method of claim 43 wherein step a) com- 

2 prises the step of encoding 1 blocks of audio with binary code 

3 bits such that n of the binary code bits are arranged to 

4 designate a distribution level of encoding. 

1 45. The method of claim 44 wherein k groups of the 

2 1 blocks of audio in an audio stream are set aside for encod- 

3 ing by k distributors of the audio, and wherein a predeter- 

4 mined combination of the n binary code bits of a corresponding 

5 group of 1 blocks of audio is arranged to indicate that the 

6 corresponding group of 1 blocks of audio has not been encoded 

7 by a distributor. 
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1 46. The method of claim 43 wherein step a) com- 

2 prises the step of encoding more than one frequency band 

3 within each block of audio with corresponding binary code 

4 bits, and wherein each frequency band is encoded by a differ- 

5 ent distributor of the audio. 

1 47. The method of claim 43 further comprising the 

2 following steps: 

3 e) encoding, by use of a primary encoder, a first 

4 group of blocks of audio with a synchronization sequence, 

5 wherein the primary encoder leaves a at least second and third 

6 groups of audio blocks uncoded; 

7 f) encoding a second group of blocks of audio so as 

8 to identify a first distributor of the audio; and, 

9 g) encoding a third group of blocks of audio so as 
10 to identify a second distributor of the audio. 

1 48. The method of claim 47 wherein the first and 

2 second groups of blocks of audio are the same group of blocks 

3 of audio. 

1 49. The method of claim 47 wherein the second group 

2 of blocks of audio is encoded by the primary encoder. 
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1 50. The method of claim 43 wherein step a) corn- 

2 prises the following steps: 

3 increasing the spectral power at one of the first 

4 and second frequencies of each block of audio so as to render 

5 the spectral power at the one of the first and second frequen- 

6 cies a local maximum; and, 

7 decreasing the spectral power at the other of the 

8 first and second frequencies of each block of audio so as to 

9 render the spectral power at the other of the first and second 
10 frequencies a local minimum. 

1 51. The method of claim 43 wherein step a) com- 

2 prises the step of selectively changing a phase relationship 

3 between the first and second frequencies in each of the blocks 

4 of audio. 

1 52. The method of claim 43 wherein step a) com- 

2 prises the following step swapping, in each of the blocks of 

3 audio, a spectral amplitude of at least one of the first and 

4 second frequencies with a spectral amplitude of a frequency 

5 having a maximum amplitude in a frequency neighborhood of the 

6 at least one of the first and second frequencies. 
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1 53. The method of claim 43 wherein the hopping is 

2 performed in accordance with a predetermined frequency hop 

3 sequence . 

1 54. The method of claim 43 wherein the hopping is 

2 based on low frequency maxima. 

1 55. The method of claim 43 wherein a synchroniza- 

2 tion block is added to the signal, and wherein the synchroni- 

3 zation block is characterized by a triple tone portion. 

1 56. A method for decoding first and second blocks 

2 of audio in order to recover corresponding first and second 

3 binary code bits therefrom comprising the following steps: 

4 a) detecting first and second frequencies from a 

5 frequency spectrum of the first block of audio; 

6 b) demodulating the first and second frequencies in 

7 order to recover to the first binary code bit; 

8 c) detecting third and fourth frequencies from a 

9 frequency spectrum of the second block of audio, wherein the 

10 third and fourth frequencies bear a predetermined offset 

11 relationship to the first and second frequencies; and, 

12 d) demodulating the third and fourth frequencies in 

13 order to recover the second binary code bit. 
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1 57. The method of claim 56 wherein: 

2 step b) comprises the step of demodulating the first 

3 and second frequencies to produce a binary code bit having a 

4 value dependent upon which of the first and second frequencies 

5 is a local maximum and which of the first and second frequen- 

6 cies is a local minimum; and, 

7 step d) comprises the step demodulating the third 

8 and fourth frequencies to produce a binary code bit having a 

9 value dependent upon which of the third and fourth frequencies 

10 is a local maximum and which of the third and fourth frequen- 

11 cies is a local minimum. 

1 58. The method of claim 56 wherein: 

2 step b) comprises the step of demodulating the first 

3 and second frequencies depending upon a phase relationship 

4 between the first and second frequencies; and, 

5 step d) comprises the step of demodulating the third 

6 and fourth frequencies depending upon a phase relationship 

7 between the third and fourth frequencies. 

1 59. The method of claim 56 wherein: 

2 step b) comprises the step of demodulating the first 

3 and second frequencies based upon a swapping of a spectral 

4 amplitude of at least one of the first and second frequencies 

5 with a spectral amplitude of a frequency having a maximum 
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6 amplitude in a frequency neighborhood of the least one of the 

7 first and second frequencies; and, 

8 step d) comprises the step of demodulating the third 

9 and fourth frequencies based upon a swapping of a spectral 

10 amplitude of at least one of the third and fourth frequencies 

11 with a spectral amplitude of a frequency having a maximum 

12 amplitude in a frequency neighborhood of the least one of the 

13 third and fourth frequencies. 

1 60. The method of claim 56 wherein the predeter- 

2 mined offset relationship is determined by a frequency hop 

3 sequence. 

1 61. The method of claim 56 wherein the predeter- 

2 mined offset relationship is determined by frequency hopping 

3 based on a low frequency maximum. 

1 62 . The method of claim 56 wherein the predeter- 

2 mined offset relationship is determined in accordance with a 

3 reference frequency, a frequency hop sequence, and a predeter- 

4 mined shift index. 

1 63 . The method of claim 62 wherein the first and 

2 second frequencies are determined according to the following 

3 equations: 



-82- 



wo 01/31816 



PCT/USOO/03829 



4 and 

5 wherein Ij and Iq are indexes corresponding to the first and 

6 second frequencies respectively, wherein Igj^ is an index corre- 

7 spending to the reference frequency, wherein Hg is a component 

8 of the frequency hop sequence, and wherein Ighift is the prede- 

9 termined shift index. 

1 64. The method of claim 56 wherein the predeter- 

2 rained offset relationship is determined according to a refer- 

3 ence frequency, a low frequency maximum, and a predetermined 

4 shift index. 

1 65 . The method of claim 64 wherein the first and 

2 second frequencies are determined according to the following 

3 equations : 

A = hk - U - 

4 and 
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5 wherein and Iq are indexes corresponding to the first and 

6 second frequencies respectively, wherein Igj^ is an index corre- 

7 spending to the reference frequency, wherein I^^^ is an index 

8 corresponding to a frequency at which the spectral power of 

9 the first block of audio is a maximum in a low frequency band 

10 of the first block of audio, and wherein Ighift is the predeter- 

11 mined shift index. 

1 66. The method of claim 56 further comprising the 

2 step of detecting a synchronization message from a plurality 

3 of blocks of audio, and wherein the synchronization message is 

4 characterized by a triple tone portion. 

1 67. The method of claim 56 further comprising the 

2 following steps: 

3 decoding 1 blocks of audio to recover a plurality of 

4 binary code bits; and, 

5 decoding n of the plurality of binary code bits in 

6 order to determine a distribution level of the audio. 
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1 68. The method of claim 67 wherein a particular 

2 combination of the n binary code bits indicates that a corre- 

3 spending group of 1 blocks of audio has not been encoded by a 

4 distributor of the audio. 

1 69. The method of claim 56 further comprising the 

2 step of decoding more than one frequency band within the first 

3 and second blocks of audio in order to recover corresponding 

4 pairs of first and second binary code bits, wherein each 

5 frequency band is encoded by a different distributor of the 

6 audio . 

1 70. The method of claim 56 further comprising the 

2 following steps: 

3 e) decoding a first group of blocks of audio so as 

4 to identify a first distributor of the audio; and, 

5 f) decoding a second group of blocks of audio so as 

6 to identify a second distributor of the audio. 
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1 71. A method of decoding blocks of audio encoded 

2 with binary code bits comprising the following steps: 

3 a) decoding each of the blocks of audio in order to 

4 recover a corresponding binary bit by demodulating the audio 

5 within the corresponding block of audio at selected first and 

6 second frequencies, wherein the selected first and second 

7 frequencies are hopped from block to block; and, 

8 b) executing step a) so as identify first and 

9 second distributors of the audio. 

1 72. The method of claim 71 wherein step a) com- 

2 prises the step of decoding n of the binary code bits in order 

3 to determine a distribution level of encoding. 

1 73 . The method of claim 72 wherein k groups of 

2 blocks of audio in an audio stream are set aside for encoding 

3 by k distributors of the audio, and wherein step a) comprises 

4 the step of decoding a predetermined combination of the n 

5 binary code bits to determine that a corresponding group of 

6 blocks of audio has not been encoded by a distributor. 
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1 74. The method of claim 71 wherein step a) com- 

2 prises the step of decoding more than one frequency band 

3 within each block of audio in order to recover a corresponding 

4 binary code bit from each frequency band and to identify a 

5 different distributor of the audio associated with each fre- 

6 quency band. 

1 75. The method of claim 71 further comprising the 

2 following steps: 

3 e) decoding a first group of blocks of audio to 

4 recover a synchronization sequence; 

5 f) decoding a second group of blocks of audio so as 

6 to identify a first distributor of the audio; and, 

7 g) decoding a third group of blocks of audio so as 

8 to identify a second distributor of the audio. 

1 76. The method of claim 75 wherein the first and 

2 second groups of blocks of audio are the same group of blocks 

3 of audio. 

1 77. The method of claim 71 wherein step a) com- 

2 prises the step of demodulating the first and second frequen- 

3 cies to recover a binary code bit having a value dependent 

4 upon which of the first and second frequencies is a local 
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5 maximum and which of the first and second frequencies is a 

6 local minimum. 

1 78. The method of claim 71 wherein step a) com- 

2 prises the step of demodulating the first and second frequen- 

3 cies to recover a binary code bit having a value dependent 

4 upon a phase relationship between the first and second fre- 

5 quencies. 

1 79. The method of claim 71 wherein step a) com- 

2 prises the step of demodulating the first and second frequen- 
3. cies based upon a swapping of a spectral amplitude of at least 

4 one of the first and second frequencies with a spectral ampli- 

5 tude of a frequency having a maximum amplitude in a frequency 

6 neighborhood of the least one of the first and second frequen- 

7 cies in order to recover a single binary code bit . 

1 80. The method of claim 71 wherein the hopping is 

2 performed in accordance with a predetermined frequency hop 

3 sequence . 

1 81. The method of claim 71 wherein the hopping is 

2 based on low frequency maxima. 
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1 82. The method of claim 71 further comprising the 

2 step of decoding a synchronization message characterized by a 

3 triple tone portion. 

1 83 . A method of decoding a block of audio in order 

2 to recover a binary code bit therefrom comprising the follow- 

3 ing steps: 

4 a) detecting a frequency having an amplitude maxi- 

5 mum within a selected frequency neighborhood of the block of 

6 audio; 

7 b) if the frequency detected in step a) corresponds 

8 to an odd frequency index, decoding the frequency as a binary 

9 code bit having a first value; and, 

10 c) if the frequency detected in step a) corresponds 

11 to an even frequency index, decoding the frequency as a binary 

12 code bit having a second value. 

1 84. The method of claim 83 wherein a frequency 

2 index bears a predetermined relationship to a corresponding 

3 frequency. 

1 85. The method of claim 84 wherein the predeter- 

2 mined relationship is given by the following equation: 
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I = (-)•/ 
J 

1 86. The method of claim 85 wherein k = 255 and 

2 wherein j = 24 . 
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